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Abstract — We consider a broadcast channel with 3 receivers 
and 2 messages (Mo, Mi) where two of the three receivers need 
to decode messages (Mo, Mi) while the remaining one just needs 
to decode the message Mo. We study the best known inner 
and outer bounds under this setting, in an attempt to And 
the deficiencies with the current techniques of establishing the 
bounds. We produce a simple example where we are able to 
explicitly evaluate the inner bound and show that it differs from 
the general outer bound. For a class of channels where the general 
inner and outer bounds differ, we use a new argument to show 
that the inner bound is tight. 

I. Introduction 

The broadcast channel with degraded message sets was 
initially studied by Korner and Marton [1] for two receivers 
and more recently in [2], [3], [4] for three and more receivers. 
Korner and Marton [1] established the capacity region for the 
degraded message sets with two receivers and some capacity 
regions for three or more receivers were established [2], [3] by 
showing that the straightforward extension of the inner bound 
in [1] was optimal. In [4], an idea called indirect decoding was 
introduced and the authors showed that this could be used to 
enhance (in some cases strictly) the straightforward extension 
of the inner bound by Korner and Marton. Unfortunately, the 
new inner bounds [4] become quite messy and unwieldy due to 
the introduction of many auxiliary random variables. However 
there is still one class of broadcast channels with degraded 
message sets where the idea of indirect decoding does not 
yield any region better than the straightforward extension of 
the Korner and Marton inner bound, and this is the scenario 
of interest here. 

Consider a 3-receiver broadcast channel with 2 messages 
(Mo, Mi) with the following decoding requirement. Receivers 
Yi and Y-2 need to decode both messages (Mo, Mi) while 
receiver Y3 needs to decode only message Mo. The traditional 
inner and outer bounds presented below remain the best known 
inner and outer bounds for this class of broadcast channels. In 
this paper we look at the general inner and outer bounds for 
this scenario in a greater detail. We show that these bounds 
differ in general, and that there is a class of channels where 
the inner bound is tight and the outer bound is weak. 

There are two main contributions in this paper: the first 
one is the technique (same spirit as Mrs. Gerber's lemma [5]) 
used to evaluate the boundary of a particular inner bound; 



the second is the use of a (| + e)-codebool{I| rather than an 
e-codebook to establish the capacity region. 

Bound 1: The union of the following set of rate pairs 
(i?o, i?i) satisfying 

Ro<I{U;Y 3 ) 

Rx < mmiliXiY^UjJiX^U)} 
Ro + Ri< min{I(X; Y r ), I(X; Y 2 )} 

over all pairs of random variables (U, X) such that U — > X — > 
(Yi, Y2, Y3) forms a Markov chain constitutes an inner bound 
to the capacity region. 

Bound 2: The union over the set of rate pairs (Rq,Ri) 
satisfying 

Ro <min{/(C/ i; Y3),/(C/ 2 ;Y 3 )} 
Ro + Ri < min{I{U i; Y 3 ) + I(X; Y^Ut), 

I{U 2 ;Y 3 )+I{X-Y 2 \U 2 )} 
R + Ri< mm{I{X; Y\),I(X; Y 2 )} 

over all possible choices of random variables (Ui,U 2 , X) such 
that (Ui,U 2 ) — > X — > (Yi,Y 2 ,Y 3 ) forms a Markov chain 
constitutes an outer bound for this channel. 

The above bounds are traditional, i.e. can be obtained using 
standard techniques. The inner bound is a straightforward 
extension of the achievability argument in [1] and the outer 
bound can be deduced by arguments in [6], [7], etc. 

Remark 1: It is also possible to include the constraint 

Ro <mm{/([/ i; Yi),/((7 2 ;Y 2 )} 

into the outer bound. However, it is quite straightforward to 
show that the region obtained by adding this inequality is 
identical to the bound we presented. 

These bounds are known to be tight in all of the following 
special cases, 

• Receiver Yi is a less noisy receiver than Y3 and Y 2 is a 
less noisy receiver than Y3 [2], [4], 

• Y3 is a deterministic function of X, 

• Yi is a more capable receiver than Y 2 (or vice-versa), 

• Ys is a more capable receiver than Y 2 (or Yi), 

'An 77-codebook is used to denote a codebook whose probability of error 
is bounded above by r\. 



The last two cases are very straightforward and the proof is 
omitted. When Y3 is a deterministic function of X, note that 
it is not difficult to show that by taking the convex closure of 
the regions obtained by setting (i) U — Y 3 and (ii) U — in 
the inner bound exhausts the following region, 

Ro < H(Y 3 ) 
R + Ri< mm{I(X; Yi), I(X; Y 2 )} 

and this clearly forms an outer bound to the capacity region. 

One class of channels that does not fall into any of the cases 
is the following channel shown in Figure Q]below. The channel 
X — > (ii,!^) represents a binary skew-symmetric (BSSC) 
broadcast channel [7], [8] and the channel X — ► Y3 represents 
a binary symmetric (BSC) with crossover probability p, with 
<p < \. 

,„-0 
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Fig. 1. 3-receiver broadcast channel 



In the next section we evaluate Bound Q] for this channel. 
Based on the symmetry, it is very natural to believe that the 
auxiliary channel U — > X must be a BSC with some cross over 
probability s. In the next section, we prove that this is indeed 
the case. This uses a technique similar in spirit to Wyner and 
Ziv's technique of using Mrs. Gerber's lemma [5]. We will 
also show that the Bound [2] yields a strictly larger region for 
this channel. Finally, we will show that the region represented 
by Bound Q] constitutes the capacity region for this channel. 

II. Evaluation of the inner bound 

In the evaluation of the inner bound, we divide the range 
< P < 5 into two regions, < p < p max and p max <p< 
\, where p m ax & [0, \] is the unique solution of 

l-h(p) = h(\)-\; 

i.e. the value of p at which capacity of the BSC matches the 
term max p ( x ) min{/(X; Yi), I(X; Y 2 )}. The numerical value 
Of Pmax « 0.184. 



A. Evaluation of the inner bound, p Pmax 

In the region < p < p max it is straightforward to see that 
the inner bound reduces to the following region (obtained via 
a time-division between the two auxiliary channels: (i) U = 
and (ii) U — X, and in each case, setting P(X = 0) = 0.5), 

Ro + Ri<h(\)-±, 

which clearly matches the outer bound (Bound [2J. Thus for 
< p < Pmax ~ 0.184, the inner and outer bounds are tight 
and give the capacity region. 

B. Evaluation of the inner bound, p max < P < 5 

Let U = {1, 2, m} and let P(U = i) = u, and P(X = 
0\U = i) = Si. Further, let 

h(x) =-x\og 2 x- (1 - x) log 2 (l - x) 

denote the binary entropy function. 
Using these notations we have, 

I(U; Y 3 ) = h(J2 t»i(«i(l - P) + (1 - Si)p)) 

i 

- ^ Uih(si(l -p) + (l- Si)p) 

i 

/(X;Fi|C/) =5>*Mf) -Y< u ^ 

i i 

I(X; Y 2 \U) = UiK^p-) < l - s ') 

i i 

I(X;Y 1 ) = h(J2^f)-J2 u ^ 

i i 

I(X; Y 2 ) = ME ^Y^) E < l «)■ 

i i 

Define U = {l,2,...,m} x {1,2}, ¥{U = = f, 

P(X = Q\U = (i, 1)) = Si , P(U = (i, 2)) = f , and P(X = 
0\U = (i, 2)) = 1 - This induces an X with P(X = 0) = 
i. It is straightforward to see the following: 

I(U;Y 3 ) > I(U;Y 3 ) 
I{X;Yi\U) = I(X;Y 2 \U) = \{I{X\ Y X \U) + I(X; Y 2 \U)) 
>miii{/(A:;yi|l7),I(^;la|^)} 
I(X; Yi) = I (it; Y 2 ) > ±(I(X; Y x ) + I(X; Y 2 )) 

From this it follows that for every U replacing U by U 
leads to a larger achievable region. Hence to evaluate Bound 
Q] it suffices to maximize over all auxiliary random variables 
of the form U defined by: U = {1,2, ...,m} x {1,2}, P(U = 
(i, 1)) = f , P(X = 0\U = (i, 1)) = Si , P(U = (i, 2)) = f , 
and P(X = Q\U = (i, 2)) = 1 - s;. 



Under this notation we have the following expression for 
the rate region given in Bound [T] 

Ro<I(U;Y 3 ) 

= h (\) ~ u M s ii l -p) + (i - s i)p), 

i 

Ri ^miniliX-Ytl^JiX-Y^U)} 
Ro + Ri < mm{I(X; Y,), I(X; Y 2 )} 



Using the symmetry of the function h(x) = h(l — x) we note 
that 

h(si(l -p) + (l- Si )p) = h((l - Si )(l -p) + s lP ) 

and thus the above region is constant under the transformation 
Si ; — ► 1 — Sj, implying we can restrict S{ to take values only 
in < Si < i. 

Before we proceed to determine the boundary of this region, 
we prove the following lemma. 

C. An inequality for a class of functions 

Lemma 1: Let f(x) and g(x) be two non-negative and 
strictly increasing functions that are differentiable in the region 
x G [arnica]- Further assume that ~py|fj is a decreasing 
function, where f^'(x) and g^>(x) denote the derivatives of 
the function. Given any u, < u < 1, let Xi nt be uniquely 
defined according to f{xi nt ) = uf(x\) + (1 — u)f{x 2 ). Then 
the following holds, 

g(xint) < ug(xi) + (1 - u)g(x 2 ). 

Proof: We have u(f(x mt ) - /(an)) = (1 - u)(f(x 2 ) - 
f{%int))> an d we wish to show that u(g(xi n t) — g(xi)) < 
(1 — u)(g(x 2 ) — g(xint))- Since all the terms are positive, this 
reduces to showing 

/(%t) ~ f(xi) f{x 2 ) - f(Xjnt) 

g(xint) - g{xi) ~ g{x 2 ) - g(x int ) ' 

However, this is immediate as shown below. 

From the fact that ■, is a decreasing function, we have 

r x rf {i Kx)dx ^ ^(x mt ) ^ [z t f^{x)dx 

J^gW(x)dx ~ g^(x mt ) ~ J* t gW(x)dx 

■ 

Repeated applications of Lemma Q] leads to the following 
corollary - potentially of independent interest. 

Corollary 1: Let f(x) and g(x) be two non-negative and 
strictly increasing functions that are differentiable in the region 
x G [xx,^]. Further assume that ^iy|~j is a decreasing 
function, where as before f^'{x) and g^'ix) denote the 
derivatives of the function. Given any m > 0, J^i u i = L 



and yi G [xi,^], let Xi n t be uniquely defined according to 
f(x int ) = J2i u if(Vi)- Then the following holds 

g(x int ) < ^Uig{yi). 

i 

D. Determining the boundary rate pairs 

We use the Corollary Q] to determine the boundary of the 
region. We make the following identifications, let f(x) = 
/i(f)+/i(if£)-l, and#0) = h(x(l-p) + (l-x)p). Observe 
that f(x) and g{x) are increasing differentiable functions in 
the region [0, \\. 

Claim 1: For ^ < p < i the ratio of the derivatives ^ n ^ x } 

6 — 1 — 2 ' gW (x) 

is a decreasing function. 

The proof of this fact is found in the Appendix. 

(Numerical simulations indicate that this is true for p m i n < 
p < i foip m in 1=3 0.05, but for the purposes of establishing the 
inner bound clearly this region of p suffices, as ^ < p max « 
0.184). 

Remark 2: By combining Claim[T]and Corollary [T]note that 
h{p * is convex in y, and this is very similar to Mrs. 

Gerber's Lemma [5]. 

Now let Si n t be defined according to 

hC-f) + H 1 —^) = £ Ui (m| ) + H 1 -^ 1 )) ■ 

i ^ ' 

Then from Corollary [TJ for p m i n < p < \ we have 
h (],) - ^£ J u i h{s i (\-p) + {\-s i )p) 

i 

^ ^(o) ~~ h ( s int(l -p) + (l- S mt )p). 

This implies that the optimal auxiliary channel U — > X is 
a BSC with a cross-over probability s and P(U = 0) = ^. 
Thus for Pmax < P < h, the boundary is characterized by the 
pair of points of the form, 

Ro = l-h(s(l-p) + (l-s)p), 
^i=min|^/ l (|)+/ l (^)-l) , (1) 

h(l)-^ + h(s(l-p) + (l-s)p)Y 

for < s < i. The second term in i?i comes from taking 
into account the sum rate constraint, 

R» + Ri<h(\)-\. 

A simple calculation shows that for p D < p < \ one can 
ignore the sum rate constraint, where p a = -^^r ~ 0.211. 
This p corresponds to the smallest value of p where the 
convex region characterized by the pairs 



Ro = l-h(s(l-p) + (l-s)p), 




has a slope of —1 at the point (Rq, i?i) = (0, h(j) — h). 
Therefore the inner bound has three different expressions: 

• < p < Pmax- the inner bound reduces to Rq + R\ < 

• Pmax < P < Po- the inner bound is given by equation (Q3 
where all inequalities are necessary, 

• Po < P < \'- the inner bound is characterized by pair of 
points of the form 

Ro = l-h(s(l-p) + (l-s)p), 

E. Comparison with the outer bound 

To show that the outer bound gives a larger region, we 
produce a particular choice of the pair (U±,U2, X). Consider 
a Ui,U2 defined as follows, 

P(Ui = 1) = P(U 2 = 1) = u, 

P(f/i = 2) = P{U 2 =2) = l-u, 

P(X = 0|t/i = 1) = P(X = 1\U 2 = 1) = 1, 

P(X = 0|f/i = 2) = P(X = 1\U 2 = 2) = s, 

where s = °- 5 ~ u for < u < 0.5. Existence of the triple 

1 — u — — * 

(Ui, U 2 , X) is guaranteed by the consistent distribution on X. 
Substituting this choice into Bound|2]we obtain Region A given 

by, 

Ro < 1 - (1 - u)h(s(l -p) + (1 - s)p) - uh(p), 
Ri < {l-u)h{^)- l - + u, 

Ro+Ri<h{±)-± 

Figure 12 plots Region A and Bound Q] for p = j. Observe 
that Region A is larger than Bound [T] and hence the Bounds Q] 
and|2]do not match for the 3-receiver channel shown in Figure 
Q] This implies the following corollary. 




Fig. 2. Comparing Bound fj] and Region A for p = \ 

Corollary 2: There exists a class of channels, given in 
Figure [1] for which the inner and outer bounds (i.e. Bounds Q] 
and [2]) do not match. 



III. Revisiting outer bound 

We now show that the inner bound is tight for the channel 
shown in Figure Q] 

Let 7T : {0, 1} h-> {0, 1}; tt(0) = 1, tt(1) = 0. 

Consider an e-codebook {a^ oTOl ,l < m < 2 nR °,l < 

m l ^ 2 "'^mj.mj C 3^1 initio, mi ^= 3^2 i^mo — 3^3 }, 

where the disjoint sets A mo _ mi7 B mo ^ mi ,C mo represent the 
decoding maps. From the skew symmetry of the channels 
X — > (Yi,Y 2 ) and the symmetry in channel X — > I3, it 
is clear that {7r(x™ o mi ), 1 < m a < 2 nRo ,l < m x < 
2 nR \ir(B mo , mi ) C jTM^cmJ £ Y 2 \TT(C mo ) C y»} 
represents a valid e-codebook as well. 

From these two codes, construct a new codebook (with error 
bounded by \ + e) and size 2 nR ° x 2 ni?1+1 as follows: The 
codewords are indexed by x 11 , where b = 0, 1. When 
6 = the codeword ^ 0!(mi>h=0) = ^m , mi and when 6 = 
1, we have x™ o (mi _ 6=1) = 7r(aj» 0ifni ). The decoding maps 
for this codebook are created as follows: If G AnJ.m} H 
7r(B m 2y) then the receiver chooses one of the two message 
pairs (ttiq, m\), (m§, m\) with equal probability. Otherwise it 
picks the message pair corresponding to the unique set A m i m i 
or n(B m 2 m -i) that it belongs to. A similar decoding strategy 
applies for receivers Y 2 and Y3 as well. 

The key feature is the symmetry of the codebook. If x n € C 
then ir(x n ) £ C and correspond to the same message Mq. 

Now observe that H(M ,M 1 \Y{ 1 ) < H(M , M x , b\Y?) < 
1 + HiMcMilY?^) = 1 + n(Ro + Ri)e n . Therefore we 
obtain the same outer bound (Bound|2]i using Fano's inequality 
and identification of the auxiliary random variables as before. 

In particular, the identifications of the auxiliary random 
variables remain the following: U u = (M , Y^' 1 , F H ™ J 

and U 2 i = (Mo, y 31 I_1 , l^i+i)- Now for the skew-symmetric 
channels and a symmetric codebook observe that 

p(a/„ = mo.^r 1 = ^r^n+i = Vu+uXi = *i) 
= J2 P(M = m ,X? = x' 1 \Y 3 r 1 =yJ- 1 ,Y 1 ? +1 =y£ +1 ) 

i-1 

" J2 P(M =rn ,X?=x^YlP(Y 3j =y 3j \X j =x j ) 

xf\xi J=l 

71 

X ]^[ P ( yife = ^lfel X fc = X k) 
k = i + l 

i-1 

= j2 p { M ° = m °' x ? = ^w)) n p ( y ^ = *(v3j)\Xj = *•(*,-)) 

71 

x f3 P(Y 2 k = K{y lk )\X k = ir(x k )) 

fc = i + l 

= ^2 p(Mo = mo,xr = ^),y3r 1 = ^ 3 r 1 )^2r+i = ^i? + i)) 

fe 5 p(a/ = mo, Ygj- 1 = n^lY^ = nb&J.Xi = 7r(^)) . 

Here (a) follows from the discrete memoryless property of 
the channel; and (6) follows from (i) symmetry of the code, 
(ii) symmetry of the channel X — ► ^3 with respect to 7r(-), 



and (Hi) the skew symmetry between receivers Yi,Y 2 i.e. 

P(y 2 = n(y)\X = ir(x)) = P(Y, = y\X = x); 

and (c) is a consequence of w(-) being a bijection. 

Therefore the random variables (U\,X) and (U 2 ,X) are 
identical up to re-labeling. Since the mutual information and 
entropy do not depend on the labeling, it follows that 

I(Ur,Y 3 ) = I(U 2 ;Y 3 ) 
I(X;Y 2 \U 1 )=I(X;Y 2 \U 2 ). 

Remark 3: This technique can be extended to other skew- 
symmetric channels as well, i.e one for which such a ir(-) 
exists. 

Therefore we obtain the following revised outer bound. 
Bound 3: The union over the set of rate pairs (Rq,Ri) 
satisfying 

Ro <I(Ui;Y 3 ) 
Ro + Ri< mm{/(<7 i; Y 3 ) + I(X; Y^Ut), 

7(*7i;y 3 ) + I{X)Y 2 \Ui)} 
Ro + Ri< mm{I{X; Y{),I{X; Y 2 )} 

over all possible choices of random variables (Ui, X) such that 
U\ — > X — > (Yi, Y 2 ,Y 3 ) forms a Markov chain constitutes an 
outer bound for this channel. 

It is straightforward to see (using the boundary points) that 
Bound |3 matches the inner bound and forms the capacity 
region. 
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Appendix 



A. Proof of Claim Q] 



In this section we show that when i < p < i, the ratio 

1-x 1 
2 



is a decreasing function of x, x G [0, i]. Recalling the 
definitions, f(x) = fo(§) + h(^^) - 1, and g(x) = h(x(l - 
p) + (1 — x)p). As f(x) and g(x) are strictly increasing in 
x G [0,4], it suffices to show that 

,(*) (a) 



/ (2) (*) < < 



(2) 



fW(x) - gW{xY 

where /^(x), g^ 2 \x) denote the second derivatives of the 
function. 

Let J(x) = log i-^, U (x) = x(l — x) and x * p — x(l — 
p) + p(l — x). Using this notation and substituting for the 
derivatives, (f2j) reduces to showing 



(3) 



J(x*p)U(x*p) 2(^(2) J{ X 2 X 



l-2p 



Now observe that as x 



> 



uW WW) 

i both J(x * p) and j(| } 



J{^~T / tenc ' to zero an< ^ a " otner terms remain positive. Thus 
we have an equality at x = \. To show the inequality for 
x G [0, |] it suffices to prove that the derivative of the left 
hand side (L.H.S.) of ® is smaller than derivative of the right 
hand side (R.H.S.) of ®. 

The derivative of the L.H.S. is given by 

d J(x*p)U(x*p) T/ . . „. 

V f_2 p = -l + J(x* P )(l-2(x*p)). 

Let us define R(x) to be the derivative of the R.H.S. , i.e. 

d 2 (^(§)-^) 



dx 



R(x) 



C/(f ) -r u{ ^) 

We wish to show that 

-1 + J(x*p){l-2(x*p)) <R(x), (4) 

for all i < p < i and x G [0, |]. Given any x G [0, |], 
observe that J(x*p)(l — 2(x*p)) is a decreasing function of 
p for < p < |. Thus establishing (01 for P = \ suffices. 

Let S(x) = -1 + J(x * |)(1 - 2(x * i)). Figure [3] plots 
S{x) and R(x). 




Fig. 3. Comparing R(x) and S(x) 

Thus we have S(x) < R(x) for < x < \. This completes 
the proof of Claim Q] 



